Reduction to Condensed Forms for Symmetric Eigenvalue Problems on Multi-core Architectures

نویسندگان

  • Paolo Bientinesi
  • Francisco D. Igual
  • Daniel Kressner
  • Enrique S. Quintana-Ortí
چکیده

We investigate the performance of the routines in LAPACK and the Successive Band Reduction (SBR) toolbox for the reduction of a dense matrix to tridiagonal form, a crucial preprocessing stage in the solution of the symmetric eigenvalue problem. The target architecture is a current general purpose multi-core processor, where parallelism is extracted using a tuned multi-threaded implementation of BLAS. Also, in response to the advances of hardware accelerators, we modify the code in SBR to accelerate the computation by off-loading a significant part of the operations to a graphics processor (GPU). Our results on a system with two Intel QuadCore processors and a Tesla C1060 GPU illustrate the performance and scalability delivered by these architectures.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Condensed forms for the symmetric eigenvalue problem on multi-threaded architectures

We investigate the performance of the routines in LAPACK and the Successive Band Reduction (SBR) toolbox for the reduction of a dense matrix to tridiagonal form, a crucial preprocessing stage in the solution of the symmetric eigenvalue problem, on generalpurpose multi-core processors. In response to the advances of hardware accelerators, we also modify the code in the SBR toolbox to accelerate ...

متن کامل

Solution of eigenvalue problems on heterogeneous computing architectures

In this paper are presented current achievements and the state-of-the-art algorithms and implementations for dense linear algebra on traditional architectures such as single-core machines or distributed memory parallel machines. Also, this paper summarizes the current implementations and publicly available libraries for basic linear algebra for multi-core and many-core architectures (e.g. graph...

متن کامل

Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing

We present a Hessenberg reduction (HR) algorithm for hybrid systems of homogeneous multicore with GPU accelerators that can exceed 25× the performance of the corresponding LAPACK algorithm running on current homogeneous multicores. This enormous acceleration is due to proper matching of algorithmic requirements to architectural strengths of the system’s hybrid components. The results described ...

متن کامل

Accelerating the reduction to upper Hessenberg form through hybrid GPU-based computing

We present a Hessenberg reduction (HR) algorithm for hybrid multicore + GPU systems that gets more than 16× performance improvement over the current LAPACK algorithm running just on current multicores (in double precision arithmetic). This enormous acceleration is due to proper matching of algorithmic requirements to architectural strengths of the hybrid components. The reduction itself is an i...

متن کامل

MR3-SMP: A symmetric tridiagonal eigensolver for multi-core architectures

The computation of eigenvalues and eigenvectors of symmetric tridiagonal matrices arises frequently in applications; often as one of the steps in the solution of Hermitian and symmetric eigenproblems. While several accurate and efficient methods for the triadiagonal eigenproblem exist, their corresponding implementations usually target uni-processors or large distributed memory systems. Our new...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009